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This paper investigates a function of macroscopic variables known as the singular po¬ 
tential, building on previous work by Ball and Majumdar. The singular potential is a 
function of the admissible statistical averages of probability distributions on a state space, 
defined so that it corresponds to the maximum possible entropy given known observed 
statistical averages, although non-classical entropy-like objective functions will also be 
considered. First the set of admissible moments must be established, and under the con¬ 
ditions presented in this work the set is open, bounded and convex allowing a description 
in terms of supporting hyperplanes, which provides estimates on the development of sin¬ 
gularities for related probability distributions. Under appropriate conditions it is shown 
that the singular potential is strictly convex, as differentiable as the microscopic entropy 
and blows up uniformly as the macroscopic variable tends to the boundary of the set of 
admissible moments. Applications of the singular potential are then discussed, and par¬ 
ticular consideration will be given to certain free-energy functionals typical in mean-field 
theory, demonstrating an equivalence between certain microscopic and macroscopic free- 
energy functionals. This allows statements about L 1 -local minimisers of Onsager’s free 
energy to be obtained which cannot be given by two-sided variations, and overcomes the 
need to ensure local minimisers are bounded away from zero and infinity before taking 
bounded variations. The analysis also permits the definition of a dual order parameter 
for which Onsager’s free energy allows an explicit representation. Also the difficulties 
in approximating the singular potential by everywhere defined functions, in particular 
by polynomials, are addressed with examples demonstrating the failure of the Taylor 
approximation to preserve shape properties of the singular potential. 

Keywords : Order parameter constraints; mean-field theory; maximum entropy methods; 
liquid crystals. 
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1. Introduction 

In many-body problems in physics it is often required to reduce the complexity 
of the problem by applying statistical methods. Consider a state space A' with a 
corresponding measure /i, and a probability distribution p on X that describes the 
probability of a given body occupying the state t £ X. Two particular examples are 
firstly nematic liquid crystals, where axially symmetric molecules can be described 
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by their orientation, so that X = § 2 and the Boltzmann equation with state 
space A' = R 3 x R 3 corresponding to the position and momentum of particles 
Thermodynamic equilibria can then be described as minima of free energy func¬ 
tionals on V(X) = {p £ L 1 (X) : f x pdp = l,p > 0 a.e}, the set of probability 
distributions on A. A particularly common example is the mean-field free energy, 
based on the second virial expansion and due to Onsager anc j j s typically given 
by 

1 v (p)=t[ p{t)\np(t)dp{t) - i f ( K(s,t)p(s)p(t)dp(t)dp(s). (1.1) 

J X J X J X 

where the function I\ £ L°°{X x X) is a symmetric positive kernel, T > 0 represents 
temperature and p is a measure on A'. This is analogous to the Helmholtz free 
energy, with the left-hand term representing an entropic contribution and the right- 
hand term representing chemical energy. This work will be concerned with the 
commonly considered case where the kernel K is of the form 

k 

K(s,t) = c ij a i(t)cLj(s) ( 1 . 2 ) 

*>j=i 

for some k £ N, constants Cij , and a set of linearly independent functions a ; G 
L°°(X) for i = 1 The functions cq will often be denoted by a single vector 

valued function, a = € L°°(X, R fc ). An alternative way to approach such 

problems is by considering only macroscopic variables typically called order param¬ 
eters, defined as statistical averages corresponding to the mean held model by 


h= f p(t)ai(t)dp(t). (1.3) 

Jx 

The vector of moments will be denoted b = (6i)i =1 £ R fe . By considering a finite 
dimensional state space of order parameters and disregarding the full statistical 
nature of the problem, analysis becomes much simpler through tools such as the 
Landau expansion (see for example Ref. [29]for a broad review). Within the context 
of the Q-tensor model for nematic liquid crystals, Majumdar noted that by lo sing 
the statistical nature of the problem, physical constraints on bi can be lost ^U. 
For example from the Holder inequality it must hold that \bi\ < Halloo for each i. 
Motivated by the notation used in nematic liquid crystals, define Q to be the set of 
admissible order parameters, 

Q= {(/ Oi(t)p(t)dt) :p£iP(A)| cR fe . (1.4) 

Majumdar investigated conditions under which equilibrium values of the order pa¬ 
rameters are physical, in that they are elements of Q, demonstrating that the Lan¬ 
dau model can fail in this sense. Motivated by this problem Ball and Majumdar 0 
defined a singular potential ip s on Q within the Q-tensor theory of nematics that 
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builds on earlier work by Katriel et. a/P^ The singular potential is a convex func¬ 
tion, inspired by the entropic term in the mean-field free energy, that blows up as 
the order parameters approach the boundary of Q. Whilst the work of Ball and 
Majumdar concentrated on ensuring physicality in static problems relatin g to ne¬ 
matic liquid crystals, it has further been applied to dynamic systemsD- 4 | 31 |, nema tic 
elastomers and the derivation of Q-tensor models demonstrating the ver¬ 
satility of the framework. The aim of this work is to extend the ideas of Ball and 
Majumdar to define a singular potential in a more general setting, as well as analyse 
its properties and develop applications. 

Before the singular potential can be defined the set of admissible moments Q 
must first be understood. This will form the bulk of Section [2] This is an example 
of the problem of moments which in the general case is poorly understood. The 
classical problem considers X = R, and functions Oi(x) = x l , although others have 
considered the more general setting. For example, Lewis has provided a characteri¬ 
sation of the set of moments that roughly corresponds to the existence of solutions 
to certain optimisation problems In Section [2] a more geometric description of 
the set Q will be useful, in which the set is described in terms of supporting hyper¬ 
planes. Whilst this description of Q is abstract and in general is unlikely to give an 
explicit expression for Q it is nonetheless appropriate for the analysis in this work. 
In particular, this characterisation is used in providing growth bounds on the sin¬ 
gular potential. Throughout both Sections [2] and [3] the assumption is made that the 
constraint functions {cti)^ =1 and the constant function ag(t) = 1 form a pseudo-Haar 
set of functions, a property first defined by Borwein and Lewis which provides an 
elegant theory without being overly restrictive to applications in mean-field theory. 

The singular potential is defined and analysed in Section [3] The singular po¬ 
tential corresponds to the greatest possible entropy of a probability distribution, 
subject to observed moments. This is an application of the principle of maximum 
entropy, pioneered in the seminal work of Jaynes which has since been influen¬ 
tial in a vast array of applications. Mathematically it can be phrased as a convex 
optimisation problem for the probability distribution subject to linear constraints. 
For the bulk of this work more general objective functions will be considered than 
the Shannon entropy, which is given by <fi(x) = rrlncc. The objective function will 
be required to share similar properties to the Shannon entropy (see Definition 13.11) . 
One of these properties is that the objective function is not differentiable at 0, which 
combined with the non-negativity constraint p > 0 has a consequence of not per¬ 
mitting two-sided variations about the minimiser p *, unless it is known beforehand 
that ess inf p*(t) > 0. Borwein and Lewis - provide an ideal framework for this kind 

t£X 

of problem, based on duality, that will be exploited in this work. The main results 
of this section are that, under appropriate conditions, the singular potential tps is 
strictly convex, has the same differentiability as the objective function, and that 
tl> s {b) blows up to +00 uniformly as the distance from b to dQ approaches zero. 

Finally in Section []] some potential applications are discussed. The results 
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demonstrate an equivalence between certain minimisation problems in V{X) and Q, 
and demonstrate the possibility of using the singular potential to rephrase harder 
questions relating to functional analysis in terms of simpler questions of several vari¬ 
able calculus. Particular mention is given to the mean-field approximation, where 
in the literature a rigorous treatment of the non-negativity constraint is often ne¬ 
glected, and the results in Section |4] provide an existence proof for minimisers as well 
as demonstrating that the solutions obtained using the first variation are correct. 
Furthermore the equivalence of local/global minimisers of the macroscopic and mi¬ 
croscopic free energies has the implication that under the assumptions presented in 
this work, equilibria of mean-field free energies can be described using only macro¬ 
scopic variables and the optimal entropy assumption. As noted by Decarreau et. al 
LL2J, the dual formulation of the optimisation problem, which rephrases the optimi¬ 
sation problem in terms of finitely many Lagrange multipliers, is the most desirable 
from the point of numerical analysis. The “moment-space” representation outlined 
in this work is loosely speaking equivalent, permits a simpler state space for an¬ 
alytical problems, and also allows the problem to be phrased directly in terms of 
order parameters. Expanding further on the work of Ball and MajumdarSl, models 
with spatial inhomogeneities will be considered and it is shown that for a particular 
class of free-energy functionals, minimisers are strictly physical, in the sense that 
the minimiser is bounded away from dQ. However the class of models in which this 
analysis works is rather limited so that it remains open if more general free energies 
have strictly physical minimisers. 

The remainder of Section [I] discusses issues surrounding the approximation of 
the singular potential by globally defined functions, in particular by polynomials. 
The Landau theory of phase transitions ® states that since the free energy should 
be analytic in the order parameters, one can consider a Taylor expansion of the 
free energy to low (typically fourth) order and perform an energy minimisation 
over the approximation. It is shown that even within some simple one dimensional 
examples, the fourth order Taylor approximation to the singular potential in gen¬ 
eral fails to reproduce various desirable properties of the singular potential such as 
convexity, the existence of a single critical point and even a lower bound. Whilst 
it is possible to approximate the singular potential by convex polynomials due to 
a Weierstrass-type result, constructive methods appear to be out reach. Similarly, 
due to the difficulty of establishing the convexity of fourth order polynomials in sev¬ 
eral variables, even providing conditions for the Taylor approximation to be convex 
app ears to be unobtainable. In response to this, the Yosida-Moreau approximation 
1221 is suggested as a globally defined approximation that preserves meaningful prop¬ 
erties of the singular potential. The drawback however is that the Yosida-Moreau 
approximation is similarly defined through an optimisation procedure that, loosely 
speaking, is no less, although fortunately no more, difficult to solve than the opti¬ 
misation problem that defines the singular potential itself. To illustrate issues with 
approximation and provide a concrete example of the singular potential framework, 
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the McMillan model — for isotropic-nematic-smectic A phase transitions is used as 
a guiding example. The isotropic phase corresponds to a disordered liquid, the ne¬ 
matic phase corresponds to molecules having orientational, but no positional order, 
and the smectic A phase corresponds to molecules having orientational order, as 
well as positional order in one direction, parallel to the molecular orientation. This 
model is described by a mean field free energy, with state space X = § 2 x [0,1], and 
two order parameters (S, a) defined as 


S [ [ p(p,x) (3cos(p • ei) 2 — l) dH 2 (p) dx 
2 Jo Js , 2 

a =- / / p(p, x) (3cos(p • ei) 2 — l) cos( 27 t:e) dH 2 (p) dx. 

2 Jo Js 2 


(1.5) 


Here ei is a unit vector. If S = a = 0 then the system is in the isotropic phase, if 
S ^ 0 and cr = 0 then the system is in the nematic phase, and finally if S ^ 0 and 
<j ^ 0, the system is in the smectic A phase. 


2. The Moment Problem 

The key condition required for the framework presented to be successful is the 
pseudo-Haar property of a finite set of functions. 

Definition 2.1 (Pseudo-Haar functions, see Ref. (8|). Let (X,p) be a finite 
measure space. Let k GN and /* : X — > R, for i = 1,..., k. The set {/, : i = 1,..., k} 
is called pseudo-Haar if for every Y C X with p(Y) > 0, the set {fi\y ‘ i = 1, ..., k} 
is linearly independent. 

Remark 2.1. If (X,p) is a measure space with atoms, so that there exists some 
t G X with p.({f}) > 0, then any set of two or more functions on X cannot be 
pseudo-Haar, since restricted to the set {f} one is simply a multiple of the other. In 
particular the theory presented here is not applicable to discrete state spaces, such 
as A a finite subset of N. 

Lewis showed that for X a subset of K", and (/i)f =1 analytic and linearly 
independent on a connected neighbourhood of X then the functions are pseudo- 
Haar with respect to the Lebesgue measure. This result can be slightly extended 
to manifolds, which has applications in liquid crystal theory, where in particular X 
is taken as § 2 for axially symmetric molecules or SO(3) for molecules with lower 
symmetry. By phrasing the problem in local coordinates, the result of Lewis can 
be used to extend the result to when the domain is a Riemannian manifold, and 
the measure taken will be the one induced by the metric, to be denoted by /i g . The 
result can trivially be extended for any other measure p! such that for all A C X, 
p'(A) = 0 => p g {A) = 0. For a relatively self contained introduction to the theory 
of integration on manifolds, the reader is directed to Ref. [3] Before attempting the 
proof, some preliminaries will be taken covered. 

Definition 2.2. Let be X an analytic manifold with atlas (( U a , 4>a)) a& j, where J 
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is some index set. Say that / : X — > K. is analytic if / o </> a 1 : 4>{U a ) —> K. is analytic 
for all a £ J. 

Proposition 2.1 (Proposition 12.1.6 of Ref. |3|) ■ Let {X, g) be an n-dimensional 
Riemannian manifold with induced measure p g . Then p g {A) = 0 if and only if 
C n ((f>(A D U)) = 0 for all charts (U, < f>) of X. 

Proposition 2.2 (From Ref. Il9j> . Let fi C K" be open and connected, and let 
f : Q —> K be analytic. Then if there exists some set A C Li with £ n (A) > 0 so that 
/U = 0; then / = 0. 

Using these results it is now straightforward to obtain the required result. 

Proposition 2.3. Let (X,g) be a connected, analytic, Riemannian manifold. De¬ 
note by /i the measure on X induced by the metric g. Let fi : X —> R be analytic 
functions on X for i = 1, ..,k for some k £ N. Then the set {fi : i = 1, ,.,fc} is 
linearly independent if and only if it is pseudo-Haar. 


Proof. If the functions are pseudo-Haar then linear independence is immediate. 
To show the converse, assume the result is false. This implies that there exists some 

/c 

£ G with £ ^ 0 and A C X so that fi g (A ) > 0 and £ifi{x) = 0 for all x G A. 

i— 1 
k 

For simplicity denote F = £ifi . Then it is clear that F is analytic, and F\a = 0. 

i= 1 

Without loss of generality take all charts to be connected by splitting the charts 
into connected components where necessary. Using Prop. [2TTl there must exist some 
chart (U, (f) so that C n (<j>(Ar\U)) > 0. For brevity let U = <f(U) and A = (f(Ar\U). 
Also let F = F o q t >~ 1 :[/—>■ R. Since F is analytic, then F is analytic. Therefore 
since = 0 with Hg{A) > 0, and F analytic on the connected open set U, by 
Prop. [2721 it must hold that F = 0 on U. Composing F with f> gives that F = 0 on 

U, which is open. Therefore since X is connected and analytic, F is zero on all of 

k 

X. Recalling the definition of F, this implies ^ U fi = 0, contradicting the linear 

3 = 1 

independence assumption and completing the proof. □ □ 


Definition 2.3 (The sets V{X) and Q). Let (X,p) be a finite measure space. 
Consider finitely many constraint functions cq £ L°°{X) for i = 1 ,...,k, such that 
the set {t e> 1} U {m : i = l,..,fc} is pseudo-Haar. For ease of notation let a = 
(ch)i =1 6 L°°(X, M fc ). Define the sets 


V{X) = G L x { X ) : p > 0 /i-a.e., J pdg = 1 j , 
j [ apdg: p£P(X)\ cR k . 


( 2 . 1 ) 


Q = 


Furthermore, if b £ Q, then b will be called physical, and if b = f x pa dp, then b is 
generated by p. 
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Definition 2.4. Let u £ S k 1 . Given the constraint functions (aj)^ =1 with a; £ 
L°°(X ) for i = 1, k, define € M by 

5 U = ess sup u ■ a(t). (2.2) 

te A' 

Let e > 0. Dehne the set E™ C X as 

E™ = {t €: X : S u < u, ■ a(t) + e} . (2.3) 

Note that since p{X) < oo, L°°{X) C L 2 (X), so that up to an invertible linear 
transformation, it can be assumed that the functions (aj)f =1 are L 2 (X) orthonor¬ 
mal functions, so that j x a^aj dp = Sij, and orthogonal to any constant so that 
f x a,i d/j, = 0. For simplicity of calculation, this will be assumed unless stated other¬ 
wise, although it has little effect on the results since the non-normalised framework 
is equivalent up to an affine map on Q. This has the consequence that the uni¬ 
form distribution, pu(t) = jpjq, satisfies f x pu(t)a(t) dp(t) = 0. This is relevant 
in Landau theory where the order parameters are required to be zero in the high 
temperature uniform state. In order to establish a necessary and sufficient condition 
for b £ Q, some properties of Q will be established. The method is to describe Q in 
terms of supporting hyperplanes. 

Proposition 2.4. The set Q is convex, bounded and open. 


Proof. Convexity is immediate from the convexity of V(X). To show boundedness, 
let b e Q, corresponding to some p £ V{X). Then 


\b\ = 


[ a(t)p(t) dp,{t) 
Jx 


< ess sup |a(t)| 
tex 


(2.4) 


so that Q is bounded. To see that Q is open, consider some b £ Q and let £ £ R fc . 
Since the functions 1 ^ 1 and ai are pseudo-Haar, by Ref. (7 there exists some 
p £ V{X) that generates b and e > 0 such that p(t) > e > 0 almost everywhere. 
Define the function £ L°°(X) by 


k 

xd*) = '52 a i(t)ti- 

i— 1 


(2.5) 


Then f x dp = 0 and f x a(t)xj(f) dp(t) = £ due to the orthogonality of (aj)f =1 

and the constant function. Note that |xf (t)| = |£ • a(t)\ < |£| ess sup |a(s)|. Hence if 

sex 


lfl< 


e 

ess sup |a(s)| ’ 
sex 


( 2 . 6 ) 


then p^ = p+Xe. is non-negative almost everywhere. Combining the previous results, 
this gives that p$ £ V(X), and J x a(t)p^(t) dp(t) = b + £, so that Q is open. □ 
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Using these properties it is now possible to characterise Q in terms of its sup¬ 
porting hyperplanes, and this will provide the necessary and sufficient condition for 
b £Q. 

Theorem 2.1. Let b £ R fe . Then b £ Q if and only if 

b ■ u < esssupa(f) ■ u = S u (2.7) 

tex 

for all u £ § fc_1 . 

Proof. By Ref. [20 it is known that a closed convex set K C K fc can be written as 
the intersection of all closed half spaces containing K , so that x £ K is equivalent 
to 


x ■ u < sup z • u (2-8) 

z&K 

for all u £ § fc_1 . If H C R fc is a bounded open convex set, then by considering 
K = LI it is straightforward to deduce that x £ 12 is equivalent to 

x ■ u < sup z ■ u = sup z ■ u (2-9) 

zGK zGfi 

for all u £ § fc_1 , since a non-zero linear function cannot attain its maximum over a 
set in the interior. By taking 12 = Q, the supremum can be computed as 

sup z ■ u = sup / u ■ a(t)p(t) dp{t) 
zeQ P GV(x)Jx 

= esssup u ■ a(t) (2-10) 

tex 

= S U , 

by considering p uniform on the set Ef = {t £ X : ess sup u ■ a(s) < u ■ aft) + e} 
and zero elsewhere, then taking e —> 0, which gives the necessary and sufficient 
condition.□ □ 

Using the necessary and sufficient condition given previously it is possible to 
formalise the intuition that if b is close to the boundary of <2, any probability 
distribution p generating b must be concentrated on some subset of X with small 
measure. Firstly, it will be shown that if b £ Q is close to the boundary of Q , then 
there is a certain set, depending on 6, on which any p £ T{X) with J x apdp = b 
must have most of its mass. Then secondly, it will be shown that as b approaches 
the boundary of Q , this set becomes arbitrarily small in measure. These results will 
also be important in providing a growth bound for the singular potential in the 
next section. 

Lemma 2.1. Let b £ Q, bo £ dQ with |6o — b\ = min \b — b\. Define u £ § fc_1 to be 

bedQ 

the unit vector in the direction ofbo — b. Then u is normal to a supporting hyperplane 
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of Q at bo, so that bo ■ u = S u , and d(b,dQ ) = S u — b ■ u. Furthermore, there is 
only one supporting hyperplane of Q at bo, with outward-pointing unit normal vector 
given by u. 


Proof. First it will be shown that u is normal to a supporting hyperplane at bo. 

Assume otherwise for the sake of contradiction. Then there exists some u' £ 8 2 , 

normal to a supporting hyperplane at bo, which must satisfy uf^u!. Let x £ be 

the projection of b onto the hyperplane given by the set of all b £ l fc with b-u' = S u /, 

In particular, x ^ Q. Furthermore, \x — b\ < |&o — b |. By considering the straight 

line from b to x, there must be some y £ dQ such that \y — b\ < \x — b\ < |6 0 — b\, 

contradicting the assumption that \bo ~ b\ = min \b — b\. Then it holds that 

b&dQ 

S u — b ■ u = (bo — b) ■ u 

= Wrvf 0 ~ b) ' (t ° “ b) <2 ' n) 

= \b 0 -b\ = d(b,dQ). 

To show that this is the only such hyperplane, assume that u\ £ § fc_1 , u\ ^ u 
and ui ■ bo = S Ul . By the same argument as before, by projecting b onto the 
supporting hyperplane with normal u\, it contradicts that bo was a closest point on 
the boundary.Q □ 


Proposition 2.5. Let b £ Q, bo £ dQ with 

|b — b Q | = min | b — 6|, (2-12) 

beac 

define u = | bo _u (^o — b) and e 2 = |&o — b\. Let S u , Ef be as in Def. \2-4\ Then 
S u — b ■ u = e 2 and 



p{t) dp(t) > 1 — e 


for all p £ V(X) generating b. 


(2.13) 


Proof. To see that S u — b ■ u = e 2 is a result of the previous lemma, and observing 
that S u — b ■ u = (bo — b) ■ u. Letting E = Ef for brevity, from the assumptions it 
can be seen that for any p £ P(S 2 ) that generates b , 

0 = u ■ b — S u + e 2 



(u ■ aft) — S u + e 2 ) p(t) dp(t) 



S u + e 2 ) p(t) dp(t) 


< e 


1 / pdp+(e 

J E 




(u ■ aft) - S u + e 2 ) pft) dpft) 


(2.14) 
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using that for t £ X, u ■ aft ) — S u < 0, and for t € X \ E that S u — u ■ aft) > e. If 
I = J E pdp for brevity, then this chain of inequalities gives that 

0 < e 2 I + (e 2 — e)(l — I) (2.15) 

so that dividing through by e and rearranging gives the desired result. □ □ 


Proposition 2.6. For u £ § fc 1 and e > 0, let E™ C X be as in Def. \2.J\ Then 
lim n{E™) = 0, where the convergence is uniform in u. 


Proof. That lim /i(E™) = 0 is a consequence of the pseudo-Haar property of the 
6—^0 

constraint functions. Since Ef = fS u — a ■ at) -1 ((—oo,e)), it holds that 


lim p(Ef) = p 
£->o e 



(2.16) 


= p({t £ X : S u — u • aft) = 0}) = 0 


by the pseudo-Haar property. 

To see that the convergence is uniform, assume otherwise so that there exists 
some 7 > 0, and (uj)j^, (ay)y e pj so that the sets Ej = E ^ satisfy p{Ej) > 7 for 
all j, and lim ay = 0. For brevity, denote the functions fjft) = S Uj — Uj ■ aft ) — ay. 

j-HX> 

Up to a subsequence (not relabelled) it can be assumed that Uj —> u* , and since the 
convergence of Uj • aft) —> u* ■ aft ) is uniform, there is a corresponding S* = S u * = 
lim S u .. Let e > 0. Let / = lim fj. Then for sufficiently large j since fj —» / 

j—¥ OO 3 j—> OO 

uniformly, it must hold that if t £ Ej then fft) > — e. This means that Ej C E “ 
for sufficiently large j. Therefore for large j, 

T{Ej) <p(Ef) 

=>limsup fi(Ej) <n(E™ ). $.17) 

j->oo 


Since e was arbitrary and lim(£'“ ) = 0, this leads to the conclusion that 0 < 7 < 

lim sup (ifEj) < 0, a contradiction. □ p 

j-* °° 


3. The Singular Potential 

In order to define the singular potential, there are certain constraints on the ob¬ 
jective function <j> that are necessary to ensure that the minimisation problem is 
well posed and the resulting singular potential has desirable properties. Motivated 
by the Shannon entropy, 4>{x) = a; In a; as the prototypical example, any function 
satisfying these constraints will be called entropy-like. 

Definition 3.1. Let (j> : R. —> (— 00 , 00 ]. Define dom(0) = {a; £ K : 4>{x) < + 00 }. 
Then is entropy-like if and only if int dom(<^) = (0, 00 ), <)>|(o,oo) is strictly convex, 
continuously differentiable, lim = +00 and lim (f>'(x ) = — 00 . 

x—>00 x z —> 0 + 
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In particular any entropy-like function is of Legendre type, so that (j)' is a continu- 
ous bijection between (0, oo) and R 2H. This definition does not place any restriction 
on the limiting value of 0 at zero. It may hold that lim 4>(x) is bounded, such as 

X—> 0 + 

in the case of the Shannon entropy, or the limit may be infinite, such as for the 
example ^ + x 2 . 

Definition 3.2. Let 0 be an entropy-like function. Define the singular potential 
ips : Q -t R by 

i>s(b) = min [ 0(p) dp (3.1) 

P^A h J x 

where the set Ab is defined as 

Ab = jp S V{X) : J p(t)a(t) dp(t) = 6 j . (3.2) 

Using the definition of the singular potential, the next results will be concerned 
with establishing some key properties of the function. 

Proposition 3.1. Let 6 G Q. Then the singular potential, ip s {b) is well defined in 
the sense that the minimisation problem admits solutions. Furthermore, the min¬ 
imiser is unique, and ij) s is a strictly convex function. 


Proof. The minimisation problem is a convex minimisation problem subject to 
linear constraints, with constraint functions that are pseudo-Haar. Together with 
the growth conditions imposed on 0 , this ensures the existence of a unique min¬ 
imising p, with p bounded away from 0 and +oo, so that if s (b) < +oo [7]. To see 
convexity, first since Q is convex there is no problem in taking convex combinations 
of elements of Q. Let bi, b 2 G Q, e € [0,1], If pi, P 2 solve the minimisation problems 
corresponding to 61,62 respectively, then epi + (1 — e)p 2 € ^4 e {, 1 +(i_ e )h 2 . Hence 

4>s (e 6 i + (1 - e) 6 2 ) = min [ 0 (p(t)) dp{t) 
pGXeb 1 + (l- e )b 2 J X 

< [ <t> (epi(t) + (1 - e)p 2 (t)) dp(t) 

Jx (3.3) 

< / e 0 (pi(f)) + (1 ~ e) 0 (P 2 (*)) dp(t) 

Jx 

= e 0 s ( 6 i) + (1 - e) 0 s ( 6 2 ). 

Furthermore, note that the strict convexity of 0 gives that for 61 ^ 62 and e G (0,1), 
Pi 7 ^ P2 on a set of positive measure, giving that this inequality is strict. □ □ 


Theorem 3.1. There exists some c > 0 such that for all 6 G Q, bo G dQ with 

min 16 — 6| = |6o — 6| = e 2 and e < c, the singular potential satisfies the inequality 
bedQ 


Mb) > KB?)* 


1 - e 

PiW) 


p ( X \£“)0 


»(X\E«) 


(3.4) 
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where Ef is as defined in Def. \2.f\ and u = i feo _ fc i (bo — b). 


Proof. Let b £ Q, with corresponding bo, e, u. Let E = Ef for brevity. From Prop. 
2.5l it is known that J E pdp > 1—e for all p with J x pa dp = b. In particular by taking 
p £ Ab to be the unique minimiser given in Prop. 13.11 so that if s (b) = f x <p(p) dp, 
then 


^s(b)= / f(p)dp+ / <t>(p)dp 
JE Jx\E 

> p(E)<j) J E pdp^j + p(x \ E)<t> 


1 


(3.5) 


p(X \ E) J X \E 

1 —e 


pdp 


by applying Jensen’s inequality. Since f E pdp > jjjffj-, using the uniform con¬ 
vergence of p(E) —> 0 in u from Prop. 12.61 and that cf> is an increasing function for 
sufficiently large argument, it must hold that there exists some ci > 0 so that if 
e < Ci then 


1 


p(E) 


pdp) > 


’ E 


C(£))' 


(3.6) 


Similarly for e < C 2 since (j) is decreasing for sufficiently small argument, it must 
hold that 


1 


p(X \ E) J x \e 


pdp) > 


( P(X\E ) 


Combining these inequalities gives the lower bound.□ 


(3.7) 

□ 


Corollary 3.1. For j £ N let bj £ Q with lim inf \b—bj\ = 0. Then lim ip s (bj) = 

j-*°o b£dQ 

+oo. Furthermore the convergence is uniform in inf \b—bj\, so that 

b£dQ 


lim (inf {i/j s (b) : d(b,dQ) < e}) = + 00 . 

e—>-0 


(3.8) 


Proof. Let b £ Q, and Ef = E for brevity. Using the lower bound for ip s , and that 
ft is bounded below, for some real constant C, independent of b, 


ips(b) > p(E)4> 


~— 

KE)J 


+ C = (l-e)^p-+C, 


(3.9) 


where ^ Using that p(E) —> 0 uniformly as b —>■ dQ and the superlinear 

growth of <f> gives the required result. □ □ 


The precise form of the minimiser can be obtained by considering the dual 
optimisation problem. For a more thorough account of duality the reader is referred 
to Ref. [2B] although for completeness a brief heuristic argument will be provided 
here. 
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Let X be some Banach space, F : X — > M be convex, and T : X —> R fc be 
continuous and linear. Let b £ R fc and consider the minimisation problem 


min F(x) 
xex 

subject to Tx = b. 


(3.10) 


Trivially, if x does not satisfy Tx = b, then max (—A • (Tx — b)) = +oo, and if 

AeR fc 

Tx = b, then (—A • (Tx — b)) = 0 for all A £ R fc . Therefore the minimisation 
problem can be written as 


min max F(x) — A • (Tx — b). (3.11) 

xex AeR k 

Let X* be the dual space to X with duality pairing T* : R fc — > X* be 
the adjoint of T, and F* : X* —> R be the convex conjugate of F defined by 
F*(x *) = sup (a;*, a;} — F(x). Assuming that the minimum and maximum in (13.111) 

xEX 

can be exchanged, this minimisation problem can be written as 

max min F(x) — A • (Tx — b) = max A • b — max(T*A, x) — F(x) 

AeR fe xex AeR fc \xex' 

= max A - b- F*(T*X). 

AeR fc 

If the interchange of minimum and maximum is permitted, then the dual problem 

max A ■ b~ F*(T*X) (3.13) 

AeRfc 

can provide information about the original so-called primal problem. Furthermore, 
since the minimisation is over a finite dimensional set without constraints, it may 
be easier to approach. This is what is done in the work of Borwein and Lewis and 
their results will be exploited for the sake of this work. In particular the minimiser 
of the primal problem can be described in terms of maximiser of the dual problem. 



Proposition 3.2. Let <f> : R —> (— 00 , 00 ] be an entropy-like function. Then for 
b £ Q, the unique minimiser of 


is achieved by 


min / 4>(p) dp 
pEAb J X 


Pb(t ) = (cf) 1 (a + A • a(t )), 


(3.14) 


(3.15) 


where a £ R, A £ R fc are the unique maximisers of the dual optimisation problem, 


max a + \ ■ b — J cf* (a + A • aft )^ dp. (3.16) 


Proof. This is a straightforward application of Ref. 0 ■ 


□ 
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The next stage is to establish smoothness properties of the singular potential. 
In order to show differentiability of ip s , the key point is to observe that the map 
from the Lagrange multipliers to the moments b is explicitly given and invertible. As 
such the differentiability of ip s is inherited from the explicit map from the Lagrange 
multipliers to Q. 

Proposition 3.3. Let ■ R —> (— 00 , 00 ] be entropy-like, C m on (0, 00 ) for m > 2, 
with (/>" positive everywhere. Then the map b 1 —x (a, A) is C m_1 , where a, A solve 
the dual optimisation problem. 

Proof. For (a, A) £ I x consider the map 

h : (a, A) i-X f (1, a(t)) (</>') 1 (a + A • a(f)) dpi[t) = (/3, b). (3.17) 

Jx 

Since (j> " is positive, the inverse function theorem can be applied to (eft) 1 , and 
the differentiability of gives that (is C m_1 on R. In particular, this implies 
that h € C m ~ 1 (R fc+ 1 ,R fc+1 ). The Jacobian matrix of h is given by 

= ® ( 1,a (*)) ° ( a + A ' a (*))) d ^' ^ 3 ’ 18 ^ 

Here o denotes function composition. If this Jacobian matrix failed to be invertible, 
there would exist some £ € R fc+1 with £ 7 ^ 0 so that 

0 = / ((1 ,a(t)) ■ £) 2 ($" o (fiy 1 (ot +a(t))^) dp(t). (3.19) 

Since <f>" is positive everywhere, this implies that (£ • (l,a (<))) 2 = 0 almost every¬ 
where in X ; however by the pseudo-Haar property this cannot hold, a contradiction. 
By applying the inverse function theorem to h , this gives that the map 

(0,b)->(a,A) (3.20) 

is C " 1-1 for (/?,&) G h(R k+1 ). In particular, h~ 1 ( 1,6), which maps b S Q to (a, A) 
is also a C m_1 function. Global invertibility is not problematic, since the existence 
and uniqueness of solutions for dual problem ensures the existence and unique pair 
(a, A) for each b € Q, and non-existence otherwise-^ □ 

Proposition 3.4. Let <f> be a C 2 entropy-like function with <f>" positive. Then for 
b € Q, ^jjf(b) = A, where A solves the dual optimisation problem corresponding to 

b. 

Proof. From Ref. [7 it is possible to write the singular potential in terms of the 
Lagrangian dual optimisation problem, so that for dual optimal a, A, the singular 
potential can be written as 

ifs(b) = a + A • b — f 4>*(a + A • a(t)) dyi(t) (3.21) 

Jx 
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Using from Prop. RO that the map b i-» (a, A) is at least C 1 , differentiating through 
gives that 

dip s da d\ f , i f da dX \ 

~db = m + db' b + x ~J x ^ ) {a + x ' a{t)) \db + db' a{t) ) Mt) 

= X + % ( X " Jx Pb ® d(j '^) + ■ ( 6 “ Jx dp< ^>) ( ' 3 ' 22 ' 1 


= A. 


□ 


□ 


Corollary 3.2. If <p is entropy-like, C m for m > 2 on (0,oo) and <f>" is positive, 
then ip s G C m (Q,R). 


Proof. Combining the results of Prop. EP1 and [3~4l it holds that 


dips 

db 


= X G C m ~ 1 (Q; K fc ), 


(3.23) 


so that ip s G C m (Q,R).Q 


□ 


Proposition 3.5. Assume <p is entropy-like, C m for m > 2 on the interior of 
its domain and that cp" is positive. Let bo G Q, and denote p = pb 0 , a = , 

Z = f x a dp. and d = f?<j. Then 

— (6, o) = — f [ a®addp — [ addp 0 [ ad dp\ . (3.24) 

db Z \J X Jx Jx > 

In particular, if <p is the Shannon entropy <p(x) = xlnx, then 

^(M = {^J a®apdp-b 0 ®bjjj . (3.25) 

Proof. Under the standing assumptions a, A as functions of b are at least C 1 . 
Following a similar argument to Prop. 13.31 

1 = f (<//) 1 (a + X ■ a) dp 
Jx 


f J da 

d X s 

j a dp 

Jx \dbi 

+ £%“ 

„ da 

/ aa dp- 

lx 

dX 

= z aC. 

dW 


(3.26) 
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Similarly, considering the relation between a, A and b gives 


bi= ai ( 4>') 1 (a + A ■ a) dp, 

Jx 

f f da <9A \ 

On = / [ at—+ di—■ a ) ad/j 


lx 


1 /' , f . d\ f dX 

= — — / ai<T dp / aa dp ■ ——h / a,;acr dp ■ —— 


(3.27) 


lx 


ix 


dbj 


dbi 


=>I = Z 


a ® ad d^L — / ad dfi ® / ad dfi ) —. 

'x Jx Jx J ob 


Since </>", and therefore cr is positive, and cr = j ] rd ^ o', this gives that d £ V(X), 
and furthermore due to the bounds on p, d is bounded away from zero. Therefore 
by use of the pseudo-Haar condition analogously to in Prop. 13.31 the matrix 


Z 


a® ad d^L 


’x 


ad dn (g> 


lx 



(3.28) 


is invertible, so that taking its inverse gives the result. To see the case when <j> is 
the Shannon entropy follows immediately since 4>"{x ) = which gives a = p, so 
Z = 1 and d = p.Q □ 


Proposition 3.6. Let (j) : R — > (—oo, oo] be an entropy-like function, C 2 with cf" 
positive on (0,oo). Then the map F : Q —> V{X) given by F(b) = pb is continuous 
with respect to the L°° topology on V{X). 


Proof. From the argument in Prop. 13.31 it is sufficient to show that the map G : 
(a, A) i->- (<//) _ (a + A • a) is continuous as a function from R fe+1 to L°°. That 
G(a, A) £ L°° is a consequence of (f)^ 1 being continuous and the inequality | a + 
A-a(t)| < |a| + |A| ||a||oo- To show continuity, fix ao , Ao- Define K = |ao| + |Ao| ||a||oo- 
Consider a, A with \a — ao| + |A — Aq| ||a||oo < e for e > 0. Then 


\a + A • a(f)| = \a - a 0 + (A - A 0 ) • a(t) + a 0 + A 0 • a(t)\ 
< e + K. 


(3.29) 


Since (</>') 1 is C 1 on R, it is Lipschitz on [— (Jv +e),K + e] with Lipschitz constant 
L, say. Hence 


{(/>') 1 (a + A ■ a(t)) - (ft) 1 (a 0 + A 0 • (a(t)) < L\a - a 0 + (A - A 0 ) • a(t)| 

< L (|a - a 0 | + |A - A 0 | Halloo) 


< Le. 


(3.30) 


□ 


□ 
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4. Applications 

For the remainder of this section, </> will be assumed to be entropy-like and C 2 with 
positive second derivative on (0,oo). In particular from Corollary 13.21 this implies 
ips is C 2 . Given b e Q, p b will be as in Proposition 13.21 and given p £ V{X), 
define b p = J x apdp. Within this section all the examples of constraint functions 
can be identified with linearly independent sets of analytic functions on connected 
subsets of R”, with corresponding measure absolutely continuous with respect to 
the Lebesgue measure, so that by^, the pseudo-Haar condition is satisfied. 


4.1. Mean Field and Nonlinear Constraint Models 


The aim of this section is to establish some elementary results on minimisation 
problems for certain functionals on F(X) by reducing the problem to a finite di¬ 
mensional problem via the machinery of the singular potential. The key point is 
to establish equivalence between global minima, local minima and critical points 
of functionals on V(X) and global minima, local minima and critical points (re¬ 
spectively) of functions on Q. Once this has been shown it is possible to establish 
standard results such as existence of minimisers by using simple tools of real analy¬ 
sis rather than having to resort to more complicated arguments based on functional 
analysis. Critical points of the functionals on P(X) will be expressed as implicit 
relations on the dual variables. This loosely provides a framework for dealing with 
such minimisation problems, whereby the model is derived as a problem in V(X) 
by some physical argument, then the singular potential allows analytical questions 
to be asked in the simpler language of real analysis as a problem in Q , and then 
finally the dual variables provide a framework that is attractive from the point of 
view of numerical analysis. 

Let 0 /Ac Q be relatively closed in Q. Let / : Q — > R be continuous on A and 
bounded from below, with / |q\a = +oo. Define the functionals Xp : V{X) —> R 
and Zq : Q — > R by 

I v{p)= [ (t>{p)dp + f(b p ), 

Jx (4.1) 

Z Q (b) = ips(b) + f(b) = l v {pb)- 
Consider the minimisation problems (PI) and (P2) given by 


(PI) min X-p(p) 

PW) (42) 

(P2) minln(fr). 
v ' bee 

The aim of this section is to show equivalence of global minima, local minima and 
critical points for (PI) and (P2) and characterise such points. For applications 
(PI) will be obtained through a physical argument, and (P2) loosely speaking is a 
simpler, macroscopic, equivalent model. The motivation for the problem (PI) comes 
from two different areas. Firstly one can see maximum entropy methods subject to 
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nonlinear constraints as an example. Let g : Q —> R be a continuous function. 
Consider the minimisation problem 

min / d>(p) du 

pgv(x) J x ( 4 . 3 ) 

subject to g(b p ) = 0 

Define f(b) = 0 if g(b) = 0 and f(b) = +oo if g(b) ^ 0 so that A = {b G Q : 
g(b) = 0}, which by the continuity of g is relatively closed. Then the minimisation 
problem 

/ <k(p)dfi +f{b p ) 

P&P(X) Jx 

is equivalent to the nonlinear constraint model. Examples from the literature of such 
models are seen in statistical models of isotropic elasticity ^3, where X = § 2 , a t (p) = 
Pi (in Cartesian coordinates) for p G § 2 represents the orientation of molecules in 
a polymer chain and b corresponds to the end to end vector spanned by the entire 
chain. The constraint in this case is of the form \b\ = r for r G [0,1). Maximum 
entr opy methods with to nonlinear constraints were also considered by Decarreau 
et.all^ as a method for dealing with the phase problem in crystallography. In 
diffraction experiments it is much simpler to observe the intensity of a wave than 
its phase, and mathematically this corresponds to having knowledge of the modulus 
of a Fourier coefficient representing the wave whilst its argument is unknown. For 
example, Decarreau et al. consider minimisation problems such as 

{ min f n <b(p(x)) dx 
P er( n) JU 

2 2 ? 

(J n cos(n • x)p(x) dx ) + (f Q sin (n ■ x)p[x ) dx) = m^ Vn G N 

where fl = [0, 27r], N is some finite subset of N and m n are given real numbers. 

A second example of problems of the form (PI) are given by the mean field 
approximation. Given some state space X and constraint functions (a,)^ =1 , the free 
energy is typically of the form 

T f 4>(p) dp - ^Kbp -b p - H -b p (4.4) 

J X “ 

where T > 0 represents temperature, K is a positive definite k x k matrix that 
represents some kind of interaction potential, and H G is representative of some 
kind of external influence such as magnetic/electric fields. In practice the Shannon 
entropy <j>(x) = x\nx is almost exclusively used, although for this analysis this is 
not necessary. 

In the literature these minimisation problems often are not dealt with rigorously, 
and in particular the issue of non-differentiability of the objective function </> at 0 
is often neglected, so that the given solution is obtained by considering only points 
where the first variation of the free energy is zero. The following results will aim 
to show that under rather non-restrictive assumptions the minimisation problems 
are well posed and the solutions behave as one would expect with a non-rigorous 
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analysis. Through use of the singular potential the minimisation problem can be 
reduced to an analytically simpler but equivalent problem (P2) in finite dimensions, 
for which one can easily obtain the desired results. 

Proposition 4.1. There exists global minimisers b* and p* of Iq and Ip respec¬ 
tively, and all global minimisers o/Iq are in one-to-one correspondence with global 
minimisers ofl-p via the map b i—>■ py. In particular, global minimisers of I-p are 
bounded away from zero and infinity. 


Proof. From the assumptions on <j), f, it is immediate that these functionals are 
bounded below. Furthermore, their infima coincide, since 


inf I-p(p) = inf ( inf Ip ip) ) 
per(x) beQ \p&A b J 

= inf Iq{V). 

bee 


(4.5) 


Since Q is a precompact set, Iq is continuous on the relatively closed set A, and 
lim Ze (b) = +oo, it must hold that a minimum exists for 2 S , with corresponding 

minimiser b*. Furthermore, since Ip(pb*) = Iq{Id*), the minimum of Ip must be 
attained at pb*. Finally, if p* is a global minimiser for Ip, then it must hold that 
p* = Pb * for b* = b p *, since otherwise Ip(p*) > 2g(6*) = Ip(pb-). □ □ 


Proposition 4.2. There is a one-to-one correspondence between local minimisers 
oflQ and L 1 -local minimisers of Ip, given by the map b h+ pb, so that p* £ V{X) 
is an L 1 -local minimiser if and only ifb p * £ Q is a local minimiser, and vice versa. 
In particular, all local minimisers of Ip are bounded away from zero and infinity. 
The equivalence also holds for strict local minimisers. 


Proof. The proof for strict local minimisers is identical to that for non-strict min¬ 
imisers, so the proof for strict local minimisers will be omitted. Let b* £ Q be a 
local minimiser, so that 2g(6*) < 2g(6) for all b £ Q with \b — b*\ < e. Define 
p* — Pb *• Let p £ V(X) with || p — p*||i < 5. Then if b = b p , |6 — b*\ < ^IjaHoo. 
Finally, 

I v {p)-Ip{p*)= [ cf(p)dp + f(b)-I Q (b*) 

Jx 

>Mb)+m-i a (b*) (4 - 6) 

=i Q {b)~i Q {b*). 

Therefore if S < —, it holds that \b — b*\ < e so that consequently Ip(p) > 

Zp{x){p*)- 

To show the converse statement, note that if p* is an LMocal minimiser of Ip, 
then p* = p^. This can be seen by considering p = (1 — 7 )p* + 7 pb * for 7 > 0 small, 
and using the strict convexity of <f>, which gives that Ip(x){p ) < Zp(x)(p*)- 
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Now let p* be an L 1 -local minimiser, so that for all p £ 'P(X) with 
||p — p*11 1 < e, Iv(p) > %v(p*)- Now consider b £ Q. Then 

r Q {b) - I a (b *) = i v (p b ) - i v (p*). (4.7) 

All that remains to show is that there exists some <5 > 0 so that for all b £ Q with 
\b — b*\ < 5, then ||p b — p*11! < e. This holds, since from Prop. 1531 it is known that 
the map 6 i—>■ pt, is continuous in L°° , and hence is continuous in L 1 .□ □ 

For the following result, a critical point of a function F : V(X) —> R is defined 
to be any p in V{X) that is bounded away from zero, such that for all £ £ L°°(X) 
with j x ^dp = 0, 

= 0. (4.8) 

T =0 


—F( P + t£) 


Proposition 4.3. Let int(A) ^ 0 and let f be C 1 on int(A). Then b* £ int(A) is a 
critical point o/Iq if and only if p* = p b * is a critical point ofl-p in {p £ V(X) : 
b p £ int(A)}. 

Proof. Since b* is a critical point of the C 1 function Zg, it holds that 

0 = Vlotb*) 

’ (4.9) 

= A(6*) + V/(6*). 

This implies that c/)'(p*) = a-Xf(b*)-a. Consider any £ £ L°°(X ) with f x £dp = 0. 
This gives 

= [ (J(p*)-Xf(b*)-a)£dp 
J x 

= a fa„ < 41 °) 

Jx 

= 0. 

The converse follows by the same argument .□ □ 

Remark 4.1. Whilst Proposition 14.21 shows that there is a one-to-one correspon¬ 
dence between local minima of I-p and Zg, an analogous result does not hold for 
local maxima. Let b £ Q be a local maximum for Zg, and let £ £ L°°(X) \ {0} be 

such that f x £ dp = 0 and f x at; dp = 0. Then p b + rt; £ A b for r£l sufficiently 

small to ensure the non-negativity constraint is satisfied, and hence 

Iv{p b + t £) = [ (/>(p b + T^) + f(b) 

Js , 2 


fpW+rQ 


> ips(b ) +f{b) 

= Ia(b) 

= Zv(Pb)- 


(4.11) 
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By taking r sufficiently small shows that pb is not a local maximum, although if 
/ £ C 1 , pb will be a critical point by Prop. 14.31 

Corollary 4.1. Consider the minimisation problem (PI). If A = Q and f £ 
C 1 (Q;K) then there exists a global minimum, and all L 1 -local minima and crit¬ 
ical points satisfy 

p*{t) = { ( j>')- 1 {a-Vf{b p )- a {t)) (4.12) 

for some constant a€l. 

Proof. This is a straightforward application of Propositions 14. 11 14.21 and I4.3l n □ 

Corollary 4.2. Consider the minimisation problem (PI). Let g £ C 1 (Q 1 R) and 
A = {b £ Q : g(b) = 0} 7^ 0. Then there exists a global minimum, and all L 1 -local 
minima and critical points satisfy 

P*(t) = {(f'Y 1 (a + pXgfb) ■ a(t )) (4.13) 

for all t £ X and some Lagrange multiplier r) £ R. 

Proof. Again this is a straightforward application of Propositions l4.1ll4.2l and l4~3l n 


Although the equivalent problem (P2) is simpler by virtue of being finite¬ 
dimensional, the non-explicit representation of the energy limits its usefulness. How¬ 
ever, an order parameter that is in some sense dual to b £ Q can be considered, 
and reduces the dual optimisation scheme needed at each point in the domain to a 
simple 1-dimensional problem. First, a lemma is required. 


Lemma 4.1. Let A £ R fc . Then there exists a unique a\ £ R such that 
lx (ft) ( a A + ^ ' a (I)) dp{t) = 1) which can be found by solving 


max a — 

a£l 



4>*{ol + A • a(t)) dp,(t). 


(4.14) 


Furthermore, if is C m on its effective domain for m > 2 then a is a C m 1 
function of X. 


Proof. Existence of such an a\ can be seen by recalling that (</>') 1 is an increasing 
bijection between R and (0,oo) and the map a 1 — > f x ( <f>')~ (a + A • a(t))dp(t) is 
a continuous function of a by applying the intermediate value theorem. Unique¬ 
ness follows from the strict monotonicity of (<//) - . The regularity follows from the 
implicit function theorem, using an argument analogous to Proposition l3.3l The ex¬ 
istence of a maximiser for the problem in Equation (14.141) follows from the existence 
of a unique critical point and the strict concavity of the functional. □ 
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Remark 4.2. In the case of the Shannon entropy, the additive property of the 
exponential greatly simplifies the previous lemma, since cz\ can explicitly be given 
as 

a\ = 1 — In exp (A • a(t )) dp(t)^ . (4-15) 

We now define A £ R k to be the dual order parameter. It corresponds to the 
classical order parameter through the relationship 

b = ( ((/)') 1 {a\ + \ ■ a(t)) a(t) dp(t). (4.16) 

Jx 

If (f> is C m for m > 2, then it follows that the relationship between the dual and 
classical order parameters is a C m_1 bijection between R fc and Q. Furthermore, if 
b x € Q is the classical order parameter corresponding to a dual order parameter A, 
then it is immediate that 

X Q (b x ) = ax + A • b x + f(b x ) = l Rk (A) (4.17) 

The map A H > b x is a continuous open map, which implies that local minimisers of 
X R k and Xq are in one-to-one correspondence also. The advantage of solving I R k is 
that the domain is no longer a constrained set, and at each point one must only 
solve a 1-dimensional optimisation problem rather than a (k + l)-dimensional one. 
In the case of the Shannon entropy, the explicit expression of as mentioned in 
Remark 14.21 requires no optimisation problem to be solved in order to evaluate the 
energy. In the case of the Onsager energy, the following proposition explicitly states 
these conclusions. 


Proposition 4.4. Let A £ R fc . Define 

Z\ = [ exp (A • a(t)) dn(t), 
Jx 


b x = 


Zx 


exp(A • a(t))a(t) djj,{t). 


(4.18) 


’X 


Then all global minimisers, L 1 -local minimisers and critical points (respectively) of 
Onsager’s free energy 


Zv(x) (p) = T f Pit) In pit) dp{t) - i I\b p ■ b p 

a X ~~ 


(4.19) 


are in one-to-one correspondence with global minimisers, local minimisers and crit¬ 
ical points (respectively) of the function 

1 


T{X-b x - HZ X )) - ^Kb x ■ b x 


(4.20) 


by the relation 


P= exp(A • a(t)). 


(4.21) 
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Proof. This follows from the preceding discussion. □ 

Finally using the machinery of the singular potential it is possible to obtain 
some simple estimates on the stability of the isotropic phase p{t) = -jppx] in mean 
field free energy models. In the following <j> will be taken as the Shannon entropy 
4>(x) = x In x, and the constraint functions are taken to be orthonormal in L 2 (X). 
Define the mean field free energy on V{X) for T > 0 and a positive definite k x k 
matrix K as 

I'p(p)=T I pin pdp — —Kb ■ b (4.22) 

J x 2 


Proposition 4.5. If T > ||a||^ 0 A ma x(A') then the isotropic state is globally stable 
and there are no other critical points of Ip. IfT > ^ X ) A max (ii) then the isotropic 
state is at least locally stable. If T < ^^yA max (A') then the isotropic state is un¬ 
stable. 

Proof. Consider the corresponding free energy functional Iq : Q —> R defined by 

l Q (b) = TMb) ~ \Kb ■ b 

Note first that the isotropic state b = 0 is always a critical point for Iq. The Hessian 
matrix of Iq at b is given by 

n ( b ) = ^Qp (fr) = T a 0 apb — b<8>b^ - K. (4.23) 

The Holder inequality and that b ® b is positive semi-definite gives 

/ a ® apb — b ® b < / a (g) apt < HaH^,/ (4.24) 

Jx Jx 

so that H(b ) > r j^-I — K. This gives that for T > ||a||^ 0 A max (/v) that H{b) is 
positive definite for all b € Q so that Iq is a strictly convex function on Q, so that 
there exists at most one critical point. This implies that the isotropic state must 
be the global minimum and only critical point. Applying Proposition [4J] gives the 
corresponding result for I-p. Due the the normalisation of the functions (aj)^ =1 it 
holds that H{ 0) = Tp(X)I — K. Consequently if T > ^tWA max (^f) then H( 0) is 
positive definite so that b = 0 is a local minimum. Conversely, if T < ^Y)A max (A') 
then H{ 0) has a negative eigenvalue so that b = 0 is unstable. Applying Proposition 
14.21 gives the desired result for the functional I-p .□ □ 

Remark 4.3. If L°°-local minimisers are a priori known to be bounded away from 
zero and +oo, then L°° variations can obtain a critical point condition for L°°- 
local minimisers analogous to Corollaries 14.11 and 14.21 Similarly, taking the second 
variation can provide estimates analogous to Proposition 14.51 to assess the stability 
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under L°° perturbations. The advantages of the methods presented in this work are 
twofold; firstly that all statements are in reference to LMocal minimisers, a stronger 
condition which could not be obtained by two-sided variations since V(X) has empty 
relative interior in L}(X), and secondly the technicality that local mininrisers must 
be shown to be bounded away from zero and +oo is removed. 

Remark 4.4. More general mean-field like free energies can be defined by 



Jx Jx Jx 


with K € L°°(X x X) a symmetric kernel. If, for example, X is a connected, ana¬ 
lytic, bounded, Riemannian manifold then the eigenvectors of the Laplace-Beltrami 
operator on A', denoted (aA^i, form a countable, dense, orthogonal basis for L 2 (A) 
with cii analytic for i £ N - . By Prop. [23] they must therefore form a pseudo-Haar 
set. This allows the decomposition of K as 


OO 



(4.26) 


so that by truncating this series to finitely many terms the kernel can be approxi¬ 
mated, and the approximating model fits into the framework presented in this paper. 
In many models related to liquid crystals this decomposition would be superfluous, 
as the kernel is assumed to have a decomposition of the form in (14.261) with Cij 
non-zero for only finitely many i,j. For example, if V\(X) denotes the eigenspace 
of the Laplace-Beltrami operator on X with corresponding eigenvalue A, then the 
constraint functions corresponding to the Maier-Saupe potential (dipolar potential, 
respectively) for nematic liquid crystals are in P_ 6 (§ 2 ) (P_ 2 (§ 2 ), respectively) 
the McMillan model (see Subsubsection 14.3.21) has two constraint functions, one 
in P_ 6 (S 2 x [0,1]) and another in V^tt (§ 2 x [0,1]), and the Strayley modeluses 
elements of VLg(SO(3)). 

Remark 4.5. Propositions 14. 11 14. 21 and 14. 31 presented together imply that globally 
stable, locally stable and unstable equilibrium points of the functionals on V(X) 
can be found by considering a macroscopic functional defined only on < 2 , combined 
with the maximum entropy assumption. 

4.2. Models with Spatial Inhomogeneities 

If, as in the Q-tensor theory of liquid crystals, one wishes to consider models with 
inhomogeneities in a domain fl C M n , then one could appeal to the calculus of 
variations and consider functions b : —> Q (note that the closure of Q is taken for 
compactness properties), and investigate minimisers of the functional 


1(b) = W(X7b(x),b(x)) + F(b(x)) dx 



(4.27) 
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in some appropriate function space, where W : R tx " x Q 4 I represents the 
energy of distortions, and F : Q —> R represents the free energy of a homogeneous 
system. However, if F were to be bounded on dQ , as is the case for a Landau 
expansion, then it is possible that minimisers could satisfy b(x ) £ dQ for all x in 
some set of positive measure, providing unphysical solutions. Furthermore this is 
undesirable from a mathematical perspective since the minimiser would not satisfy 
the Euler-Lagrange equation. However, if one considers a functional X B , given by 

l s (b) = [ W(Vb(x),b(x)) + i/> B (b(x)) + f(b(x)) dx (4.28) 

J n 

for some /, W bounded away from — oo, then provided finite energy configurations 
exist, it is immediate that minimisers satisfy b(x) £ Q for almost every a: £ 11 . 
Furthermore, in particular simple models it is possible to show strict physicality, in 
the sense that there exists some compact set K C Q for which any global minimiser b 
satisfies b € K almost everywhere, which has the consequence that global minimisers 
satisfy the Euler-Lagrange equation allowing PDE methods to be applied to the 
problem. This is shown by a projection method based similar to that in Ref. jU 

Lemma 4.2. Let g : Q —> R be Lipschitz continuous with Lipschitz constant L. 
Given M £ R define the set Km = {b £ Q : fi> s (b) < M}, and the nearest point 
projection Pm '■ Q —> Km- Furthermore given b £ Q denote bM = Pm 6 . Then there 
exists Mq £ R so that for all M > Mq and b £ Q, 

4>s(b) + g(b) > ip s (b M ) + g(b M )- (4.29) 

Proof. First note that the projection is well defined, since the convexity and con¬ 
tinuity of ip s ensure that Km is a closed convex set. Suppose that the result is false. 
Then there must exist sequences M k £ R, bk £ Q so that lim M k = +oo, and 

k—± oo 

defining Ck = Pm k bk , for each k the inequality 

V’s(cfc) + g(ck) > i’sifik) + g{bk) 

is satisfied. By taking a subsequence if necessary (not relabelled), assume that 
(M fc )fe g pj is an increasing sequence. Using the fact that g is Lipschitz and elementary 
inequalities for convex functions, it holds that 

L\ck - b k | > g(c k ) - gifik) 

> ip s (b k ) - ip s (c k ) (4.30) 

> V^s(cfe) • (bk - c k ). 

From the construction of Km, OKm is a level set of ij) a , so that Vip s (ck) is normal to 
the surface dK M * at Ck- Also from the properties of projections it holds that bk — Ck 
is normal to dK M k at Ck, with the same sign as V^ s (cfe) in the sense that 'S7ip s (ck) = 
s(b k - c k ) for some s > 0. This implies V^ s (c fe ) • ( b k - c k ) = |V^ s (c fc )| \b k - c k |. 








26 Jamie M. Taylor 


Combining this with the previous chain of inequalities gives 

L\c k -b k \ > |V^(c fc )||cfc-6fc| 

(4.31) 

=> L > |V-0s(cfc)| 

for all k. However since Q is bounded, a subsequence (not relabelled) can be taken 

so that Cfc — > c*, and since c*, £ dK M k for all k, this gives that c* € dQ. Since 

lim tp s {b) = +oo, and 'i/’s is convex, this also implies that lim |V'0 S (&)| = +oo. 
b—>dQ b^dQ 

Taking the limit as k —> oo of the inequality L > |V^ s (cfc)| gives a contradiction, 
completing the proof.D □ 


Theorem 4.1. Let Q C R™. Consider the minimisation problem 

min f \\/b(x)\ p + if) s (b(x)) + g(b(x)) dx, (4.32) 

b ^Jn 

where A = {b £ W 1,p (f2; Q) : b\gn = &o} 7^ 0, g '■ Q —> R is Lipschitz continuous 
with Lipschitz constant L, p > 1 and there exists some compact K C Q such that 
bo(x ) E X for r H n ~ 1 -almost every x € dfl. Then there exists a global minimiser, and 
all global minimisers are strictly physical in the sense that ifs{b) £ L°°(fl,R). In 
particular, any global minimiser b* : LI —> Q satisfies the Euler-Lagrange equation 
for the energy functional if g £ C 1 (Q). 


Proof. The existence of minimisers follows from a standard direct method argu¬ 
ment 11. To see that the global minimiser must be strictly physical, assume for the 
sake of contradiction that there exists a global minimiser b £ A such that fi s ib) 
is unbounded on Ll. Let M > Mq (as defined in Lemma m and be the pro¬ 
jection of b onto the set Km as defined in Lemma 14.21 Take M sufficiently large 
so that K C Km, so that satisfies the same boundary conditions as b. Then 
if s {b M ) + g(b M ) < ip B {b) + g{b) almost everywhere in by Lemma IT/21 Further¬ 
more, since the term on the left is bounded, and by assumption the term on the 
right is unbounded this equality is strict on a set of positive measure. Finally since 
|VP6| < j V6| for any function in W 1,p and projection P onto a convex set (see 
Lemma 10 in Ref. |4j) , it holds that 


/ \Vb M (x)\ p + i/j s (b M (x)) +g(b M (x))dx < / \Wb(x)\ p + i/j s (b(x)) + g(b(x)) dx, 

J JQ 

(4.33) 

contradicting that b is a global minimiser. Since the minimiser is bounded away 
from dQ it is possible to take smooth variations so that the minimiser satisfies the 
Euler-Lagrange equation. □ □ 


4.3. Approximation of the Singular Potential by Everywhere 
Defined Functions 

In some situations it may be preferable to approximate the singular potential by a 
globally defined function. In particular, as mentioned in Subsection 14.21 in models 
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with inhomogeneities the blow up property can pose issues with minimisers satisfy¬ 
ing the Euler-Lagrange equation. One tool kit for providing such an approximation 
is to use the Landau theory 1^, which argues that the free energy of a system must 
be analytic in the order parameters, therefore it is possible to replace the free en¬ 
ergy with a polynomial in the order parameters that respects the symmetry of the 
system. Typically, this polynomial will be taken to fourth order. However, as will 
be shown in this section, the Landau expansion is not necessarily compatible with 
free energies of the form given in Subsection 14.11 , since the Taylor approximation 
does not necessarily preserve shape properties possessed by the singular potential. 
This section will only be concerned with the Shannon entropy, due to the relative 
simplicity of its Taylor approximation as well as its physical relevance. 

4.3.1. One Dimensional Examples and Counterexamples 

The fourth order Taylor approximation about zero of the singular potential is de¬ 
rived in Appendix |Appendix A| For simplicity consider only a single constraint 
function, and state space X = [—1,1]. Define nti = ~ J x a(x) 1 dx. The the fourth 
order Taylor approximation : R —> R. is given by 


A 



If the Taylor approximation is to represent a kind of macroscopic version of entropy, 
and approximate the singular potential, then at the very least it should be convex, 
posses a single local minimum, and be coercive so that the approximation blows 
up to Too as b —> Too. Coercivity perhaps the most important property, since if 
it fails then energy minimisation is not possible. The second condition prevents 
the appearance of “phantom” minimisers for the entropy which should not exist. 
Finally convexity is a useful property in minimisation problems that one would want 
to inherit from the singular potential. In the one dimensional case presented, these 
three conditions have a chain of implication, so that convexity implies the existence 
of a single critical point, and a single critical point implies coercivity. Note that 
these implications rely on the fact that ipf is convex in some neighbourhood of the 
origin, and that it is not linear. Similarly since is a polynomial that is not affine, 
convexity implies strict convexity. 

Fortunately in the one dimensional case, verifying if these properties hold for a 
given example is relatively straightforward. In order to be coercive, the coefficient 
of 6 4 must be positive, which gives the condition 


d\ = 2>m\ — TO27TI4 — 3r?7,2 > 0. 

If b ^ 0 is a critical point of , this implies the existence of a non-zero real solution 
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to 


lt {V) = b {J- S5"* 3 '’ + 6^4 < 3m ’-"* 2m ‘ + 3m >) b j = “■ 

By considering the discriminant of the quadratic, this means that i/>f possesses no 
non-trivial critical points if and only if 

d,2 = 72m2 — 24m 2 TO4 + 63 m\ > 0. 

Finally for convexity, the second derivative of ipf is a quadratic in b given by 

1 m 3 , 1 


db 2 


(b) =- 1 b + -—5 (3ml — ra 2 ra 4 + 37713 ) 6 2 , 


777-2 


77i: 


so by the same argument ^ is convex if and only if 

d 3 = 6 TO 2 — 2?7l2??T4 + 57743 > 0 - 

Within the chain of implication that 

Convexity => A single critical point => Coercivity, 

none of the implications are equivalent for the singular potential. Using the domain 
X = [—1,1], there are four examples that produces all four possibilities. 


a(x) = 

X 

fK§g|l| 


Z 3 

777-2 = 

1 

3 

904 

525 

92 

9 

1 

7 

777-3 = 

0 

-6796096 

3378375 

-12112 

945 

0 

777-4 = 

1 

5 

4297061248 

287161875 

6938192 

19305 

1 

13 

di - 

2 

45 

765059068785152 

452732233078125 

998000512 

42567525 

-10 

4459 

Coercive 

Yes 

Yes 

Yes 

No 

d 2 = 

16 

15 

624228970176512 

150910744359375 

-618770176 

675675 

-240 

4459 

Single Critical 

Point 

Yes 

Yes 

No 

No 

d 3 = 

4 

45 

-905889167393792 

1358196699234375 

-2998034944 

25540515 

-20 

4459 

Convex 

Yes 

No 

No 

No 


Fig. 1: Table of examples and discriminant values 


As these simple examples show, the Taylor expansion approach to providing a 
polynomial approximation to the free energy will not necessarily work in general, 
and this is perhaps unsurprising given that the Taylor approximation is generally 
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only a local approximation of a function. However, the following Weierstrass-type 
result shows that convex polynomial approximation is possible. 

Proposition 4.6 (From Ref. I27|) . Let M C be convex and compact, and 
f : M —> M be convex and continuous. Then for every e > 0 there exists some 
polynomial p e which is convex on such that 

sup \f(x)-p £ (x)\ < e. 

iEM 

The main limitations of this result however are that the approximation can only 
be performed on compact subsets of Q, and also, much like the classical Weierstrass 
result, the proof is non-constructive. It should also be noted that this result requires 
the function to be only continuous, rather than analytic, so that the Landau theory’s 
assumption of an analytic free energy is unnecessary. One might hope that it is 
possible to establish a necessary and sufficient condition for the fourth order Taylor 
approximation to be convex, but in general the problem of establishing if a given 
polynomial is convex is an NP-hard problem I. Rather than pursuing a polynomial 
approximation, the Yosida-Moreau regularisation may be more appropriate, due to 
its shape preservation properties. 

Definition 4.1. For J > 0, define the Yosida-Moreau regularisation of if s , denoted 
%!) J by 

1> J {b) = mini/> s (&) + ^|&-6| 2 . (4.34) 

beQ 4 

The following proposition outlines several of the key properties which suggest 
that the Yosida-Moreau approximation is an appropriate approximation for the 
singular potential. The results presented are for the general entropy-like objective 
functions and pseudo-Haar constraint functions. 

Proposition 4.7. 

(1) The Yosida-Moreau approximation preserves minima, in the sense that for all 
J > 0, min = ip J ( 0) = Vfs(0) = min ip s . For all b £ R fc , J > 0, the min¬ 
imisation problem defining %f J , as defined in Equation (14.341) . admits a unique 
solution denoted Gj{b). 

(2) For every b £ R fc , 4 >J {b) ^sib), where the limit is infinite for b (jL Q. 

(3) For all J > 0, ip J is a convex, differentiable function. Furthermore, the gradient 
is Lipschitz with Lipschitz constant J. The derivative of can be given in 
terms of Gj(b) by 

^-(b) = J (b— Gj(b)) = A ((Gj(b)). (4.35) 

(4) The Yosida-Moreau approximation can be evaluated numerically by the dual 
problem, 

ip J (b)= max a + X-b— [ <f)*{a + A • a) dp — —| A| 2 . 

aGK,AGlR fc J x ^<7 


(4.36) 
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Proof. The preservation of minima in statement[l]is immediate from the definition, 
by testing 6 = 0. The existence and uniqueness of a minimiser follow from the 
coercivity and strict convexity of the objective function. Statement [2] can be found 
in Ref. EH Statement [3] can be found in Ref. [9j Statement |4] is given in Ref. HElD □ 

Remark 4.6. The previous proposition has several consequences. Firstly, state¬ 
ments 1 and 2 say that i\) J approximates ip s , as well as preserving desirable shape 
properties. Statement 3 gives that is sufficiently regular for first order methods 
to be used. Lastly, statement 4 shows that the dual optimisation problem defining to 
ip J is, at face value, no harder to approach than the optimisation problem defining 

ips- 


4.3.2. The McMillan Model 


As an illustrative and physically meaningful example consider the McMillan model 
for Isotropic-Nematic-Smectic-A phase transitions This is a mean field theory 
as described previously where the state space is X = § 2 x [0,1], and two constraint 
functions are given by 

ai(p,x) =- (3(p- ei) 2 - l) , 
a 2 (p, x) =- (3 {p ■ ei) 2 - l) cosher). 


Here e\ is a unit vector, physically corresponding to the orientation of the material. 
Due to the rotational symmetry of the constraint functions, it is possible to consider 
only state variables ( 9 , x) £ [0,7 r] x [0,1], where cos(0) = p-e\. This approach views 
X as equivalent to a subset of R 2 with measure dp(9,x) = 2n sin(0) d6 dx. The 
constraint functions are analytic and linearly independent, and since the measure 
/i has the same null sets as the Lebesgue measure, the pseudo-Haar condition is 
satisfied. These constraints give two order parameters, denoted 



ai( 0 , x)p(9 , x) dO dx 
a 2 (0, x)p(9, x) dO dx. 


Loosely speaking S corresponds to the degree of orientational order of the molecules, 
and cr represents a coupling between the order of the molecules and the location of 
their centre of mass. If S = a = 0, then the sample is in an isotropic phase. If S' 7 ^ 0 
and a = 0 then it is a nematic phase, and if S ^ 0 and cr ^ 0 then it is a smectic A 
phase. Before any analysis can be performed, the set of physical moments will be 
established. 


Proposition 4.8. The set Q for the constraint functions of the McMillan model is 
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given by 

Q=<jW)eK 2 :Se (-1 l),M<^}. (4.37) 


Proof. To see that the candidate set contains Q, use Proposition 12.11 and test 
against (±1,0) and (— j,±l). The maxima ai and —a\ are 1 and \ respectively, 
which gives that S £ (— l). The maximum of — ^a\(6,x) — 0 , 2 ( 0 , x) can be found 

by noting that 

-iai(6»,x) - a 2 (0,x) (l - 3cos(0) 2 ) ( 1 ±cos(27ra:) ) 
d 21 J (4. 38 ) 

1 4 _ 2 

~2 ' 3 ~ 3’ 

which is attained at 9 = tt and x = 0. For u = (— g, l) the same argument is used. 
To show that Q contains the candidate set is equivalent to showing that the closure 
of the candidate set is a subset of the closure of Q since the sets are convex. To 
prove this, it is then sufficient to show that the four vertices ((1, ±1), (-|,±|)) are 
contained in the closure of Q. This can be done constructively, and here only one 
vertex will be proven with the rest being shown by the same method. Let 0 < e < 1, 
and define the set A e = {(0,x) : cos (0) 2 > 1 — e,cos(27ra;) > 1 — e}. Define 

p e (0,x) = ^27t J sin(0)e?0^ 2it sm.(0)xA e (® 1 x ) ■ (4.39) 


This corresponds to a distribution uniform with respect to the measure on A' on 
A € . Let the corresponding moments be denoted S e ,o e . Then it is immediate that 


1 > >|(3(1 - e) - 1) = 1 - |e 

1 > 0 -e > ^( 3 ( 1 - e ) - 1 )( 1 - e ) = 1 - \ e + 


(4.40) 


Therefore ( S e ,o e ) £ Q, and by taking e to 0, (1,1) £ Q. □ 


□ 


Using the formula given in Appendix |Appendix A| the exact symbolic integra¬ 
tion package in Maple gives the fourth order Taylor approximation to the singular 
potential as 


/ 4 / c w 425 4 50 2 2 825 
^ sV ’ ' 196 49 196 


(7 — 


21 


-u u ~r 






Proposition 4.9. For the McMillan model, ip^ is coercive, so that 

, 1 ™ ipt(S,cr) =+00. 


Proof. It is sufficient to show that the fourth order terms are coercive. This can 
be written as a bilinear form in S 2 and cr 2 , since 


425 

196' 


S 4 


50 2 2 825 2 

49 a ^ + 196 fT 


~s 2 ' 


’425 

25 ‘ 


's 2 ' 



196 

49 



_2 


25 

825 


_2 

cr 


L 49 

196 J 


cr 
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The eigenvalues of the matrix are given by f|§ ± |§-\/5, which are both positive 
(evaluating at approximately 2.04 and 4.33), so that the leading order terms are 
positive. □ 


Proposition 4.10. In the McMillan model, ip* is not a convex function of S, a, 
even when restricted to Q. 


Proof. It is sufficient to show that the Hessian matrix of ip* has a negative eigen¬ 
value for some (S,a). The Hessian matrix is readily computed as 


1 

r12755' 2 + IOOct 2 - 3505 + 245 

2005cr - 700 ct 

H(S ’^ =49 

2005cr - 700 ct 1005 2 + 2475cr 2 - 7005 + 490 


fl275- 350 + 245 0 


=*■ H (1,0) =— 

0 

100 - 700 + 490 


1 

'1170 0 



49 

0 -110 




Therefore it has a negative eigenvalue, and ip* is not convex in this case. In partic¬ 
ular, it is not convex in a neighbourhood of (1, 0), since the Hessian is continuous. 
By noting that the intersection of any neighbourhood of (1,0) with Q is non-empty 
since (1,0) £ dQ , this implies that ip* is not convex on Q. □ 


To conclude, contour plots of of the singular potential and its approximations 
are given in Figure [2] The singular potential itself is given in Subfigure [5a] It should 
be noted that ip s is only given on {(/S', a) £ Q : d ((5, a), dQ) > 10~ 2 } to avoid the 
difficulty in calculating the singular potential near dQ. The fourth order Taylor 
approximation is given in Subfigure I2bl Also included on the plot is a white dashed 
line, which is the boundary of the set where the Hessian matrix has a negative 
eigenvalue. The Yosida-Moreau approximation for J = 100 is given inSubfigurel2cl 
The numerical values for ip s and ip J were obtained via a steepest descent algorithm 
on the dual optimisation problem. 
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Fig. 2: Contour plots of the singular potential and its approximations. 


Appendix A. Appendix: The Fourth Order Taylor Expansion for 
Shannon Entropy 

For this section consider the pseudo-Haar constraint functions (a,;)f =1 to be orthog¬ 
onal to the constant function so that f x at dp = 0, but not necessarily orthogonal 
to each other. Furthermore the in this section the convention that Greek indices 
are summed over, whilst Latin indices are free, is used. Given neN and p £ V{X) 
define the tensor M n £ (K. fe ) n component-wise by 

M h.i 2 ...i n (p)= a il a i 2 ...a in pdp. (A.l) 

Jx 

for ij = 1,.., k, j = 1, ..n,. Consider the function b H > M n (pt„), which when unam¬ 
biguous will simply be denoted M n , with pi, maximal entropy under the Shannon 
entropy <j){x ) = a:ln;r. This allows any maximal entropy p £ V{X) to be written as 

p{t) = ^ exp (A • a{t)) . (A.2) 

The map b K > M n (pb) is differentiable, so by applying the chain rule the derivative 
can be found as 

dbi n - |_i &bi n + i 

= (^j a ii a i2-'- a in a aPb dp — ((-Af — b (g) b) ) a j n+1 

= M? li2 ... in b a )V ain+1 

(A.3) 
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where for brevity V = (M 2 — 6 (g> 6) 1 . That the derivatives can be seen to satisfy 
this by applying Propositions [373] and [3/4] with some elementary calculus. From here 
it is now possible to perform a Taylor expansion of ip s to low order. Immediately 
it holds that ip s (0) = — ln(/z(AT)). For the first derivative it was shown in 

Proposition 13.41 that 


W (6) = Ai - (A ' 4) 

Using the strict convexity of </>, it is immediate that the uniform distribution is the 
global entropy minimiser of V(X), so that in particular = pb(t) for 6 = 0 

and almost every t £ X. By rewriting the uniform distribution on A' as = 

((/> , ) _1 (a + A • a(t)) for A = 0 and a = (</>') (/Lt(A') -1 ) demonstrates that A(0) = 0, 
so that 




Furthermore this gives that 6 = 0 is always a critical point for ip s and for any 
truncated Taylor expansion of 'i/’s about 6 = 0 with order greater than or equal to 
1. For the second derivative it has been seen in Prop. [33] that 


d 2 ^ s 

db Zl db l2 


(6) = ((M 2 (6) 



ay, 

db n db, i2 


(0)=((M 2 (0)) ') 

\ / 1x12 


(A.5) 


As seen before, this is necessarily a positive definite matrix, and consequently 6 = 0 
is always a local minimum for x/j s and any Taylor expansion of order greater than or 
equal to 2. For the third derivative first note that the derivative of V will have to be 
taken and since this involves a matrix inverse a more complex expression appears, 


dV ili2 T/ T r dM 2 ia2 — b ai b a2 

56,3 iiai i2 “ 2 56,3 

= — VcniiVot^Votziz {Ma 1 a 2 a 3 ~ ^«l-^a 2 ct 3 — ^2^0103 — ^3^aia 2 ) • 

(A.6) 


This gives that the third derivative at 6 = 0 can be given by 

d 3 i/j s 


db^db^db^ 


(0) = - W0)W0)W0)A£ iaaa3 (0). 


(A.7) 


Since the Taylor expansion will only be taken to fourth order, and the expansion 
is to be performed around 6 = 0, all terms with factors of 6 will be denoted as Ab, 
which can later be neglected in calculating the fourth derivative of ips at 6 = 0. It 
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can be given as 

-Jfj QIj . QJj . Vilfil ^2Q!2 ^ 30:3 V 4 0 3 Vcxi02^aiOt2CX3^010203 

“1“ Viiai V i2 ^ V i3Ci3 Vi 4 0 3 Vcx 2 02 ^010203 ^010203 
V iiai Vi 3 Qi2 ^ 3/31 V 4 0 3 Vq,302 Mq Li cx2CX3 ^010203 

(ViiOii Vi 2 a 2 Vi 30i3 Vi 40L4 ) 


Ab. 


CXI&2OI3CX4 


- M 2 M 2 - M 2 M 2 - M 2 M 2 ) 

ex±ex2 CX3CX4 aia3 Q2Q4 0:10:4 0203/ 


(A.8) 

At the isotropic state it must hold that Ab = 0, so that the fourth derivative at 
b = 0 is given by the remaining terms. 
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